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Abstract: We prove that functions over the reals computable in polynomial time can 
be characterised using discrete ordinary differential equations (ODE), also known 
as finite differences. We also provide a characterisation of functions computable 
in polynomial space over the reals. 

In particular, this covers space complexity, while existing characterisations were 
only able to cover time complexity, and were restricted to functions over the 
integers. We prove furthermore that no artificial sign or test function is needed 
even for time complexity. 

At a technical level, this is obtained by proving that Turing machines can be simu- 
lated with analytic discrete ordinary differential equations. We believe this result 
opens the way to many applications, as it opens the possibility of programming 
with ODEs, with an underlying well-understood time and space complexity. 
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1 Introduction 


Recursion schemes constitute a major approach to classical computability theory and, 
to some extent, to complexity theory. The foundational characterisation of FPTIME, 
based on bounded primitive recursion on notations, due to Cobham [16] gave birth to 
the field of implicit complexity at the interplay of logic and theory of programming. 
Alternative characterisations, based on safe recursion [1] or on ramification ([23, 22]) 
or for other classes [24] followed: see [14, 15] for monographs. 
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Initially motivated to help understand how analogue models of computations compare 
to classical digital ones, in an orthogonal way, various computability and complexity 
classes have been recently characterised using Ordinary Differential Equations (ODE). 
An unexpected side effect of these proofs is the possibility of programming with 
classical ODEs, over the continuum. It recently led to solving various open problems. 
This includes the proof of the existence of a universal ODE [10], the proof of the Turing- 
completeness of chemical reactions [17], or hardness of problems related to dynamical 
systems [19]. | While the above results are easy to state, their proofs are mixing 
considerations about approximations, control of errors, and various constructions to 
emulate continuously some discrete processes, despite some recent attempts for a kind 
of programming languages to help intuition [6]. 


Discrete ODEs, that we consider in this article, are an approach in-between born from 
the attempt of [8, 9] to explain some of the constructions for continuous ODEs in 
an easier way. The basic principle is, for a function f(x), to consider its discrete 
derivative defined as Af(x) = f(x + 1) — f(x) (also denoted f'(x) in what follows to 
help analogy with classical continuous counterparts). A consequence of this attempt is 
the characterisation obtained in [8, 9]. They provided a characterisation of FPTIME 
for functions over the integers that does not require the specification of an explicit 
bound in the recursion, in contrast to Cobham’s work [16], nor the assignment of a 
specific role or type to variables, in contrast to safe recursion or ramification [1, 21]. 
Instead, they only assume involved ODEs to be linear, a very classical natural concept 
for differential equations. A characterisation, like ours, happens to be rather simple 
using only common notions from the world of ODEs. In particular, considering linear 
ordinary differential equations is very natural for ODEs. 


Remark 1 Unfortunately, even if it was the original motivation, both approaches for 
characterising complexity classes for continuous and discrete ODEs are currently not 
directly connected. A key difference is that there is no simple expression (no analogue 
of the Leibniz rule) for the derivative of the composition of functions in discrete settings. 
The Leibniz rule is a very basic tool for establishing results over the continuum, using 
various stability properties, but similar statements cannot be established easily over 
discrete settings. 


In the context of algebraic classes of functions, the following notation is classical: call 
operation an operator that takes finitely many functions and returns some new function 
defined from them. Then [f1 , f2,- - - fk; OP1, 0P2, . - -, Ope] denotes the smallest set of 
functions containing f1, f2, . . -fg that is closed under the operations op,, opy,... Opo. 
Call discrete function a function of type f : Sı X -++ x Sa > S| X ...S!,, where 
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each S;, S; is either N or Z. Write FPTIME for the class of functions computable in 
polynomial time, and FPSPACE for the class of functions computable in polynomial 
space. 


The literature considers two possible definitions for FPSPACE, according to whether 
functions with non-polynomial size values are allowed or not. In our case, we should 
add “whose outputs remain of polynomial size”, to resolve the ambiguity. 


Remark 2 The point is that, otherwise, the class is not closed by composition. This 
may be considered as a basic requirement when talking about the complexity of func- 
tions. The issue is about the usage of not counting the output as part of the total space 
used. In this model, given f computable in polynomial space, and g in logarithmic 
space, f o g (and g of) is computable in polynomial space. But this is not true if we 
assume only f and g to be computable in polynomial space, since the first might give 
an output of exponential size. 


A main result of [8, 9] is the following (LDL stands for linear derivation on length): 


Theorem 1.1 ([8]) For functions over the reals, we have LDL = FPTIME where 
LDL = [0, 1, ak, Lx), +, —, x, Sg(x) ; composition, linear length ODE]. 


In particular, writing as usual B4 for functions from A to B, we deduce: 


Corollary 1.2 (Functions over the integers) LDL N NN = FPTIME NNN. 


That is to say, LDL (and hence FPTIME for functions over the integers) is the smallest 


class of functions that contains the constant functions 0 and 1, the projections 7* of 
the i coordinate of a vector of size k, the length function @(x), mapping an integer 
to the length of its binary representation, the addition x+y, the subtraction x—y, the 
multiplication x x y, the sign function sg(x) and that is closed under composition (when 
defined) and linear length-ODE scheme: the linear length-ODE scheme, formally given 
by Definition 2.4, corresponds to defining a function from a linear ODE with respect 
to derivation along the length of the argument, so of the form 

f(x, y) 

ol 


= A[f(x, y), x, y] -f@, y) + BIf(@, y), x, y]. 
Here, we use the notation — which corresponds to the derivation of f along the 
length function: given some function £ : N’*+! — Z and in particular for the case 
where L(x, y) = &(x), 

(1-1) f(x,y) _ fx, y) 
OL OL(x,y) 


= h(f(@, y), x,y) 
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is a formal synonym for 


fœ + 1,y) =f@,y)+(L@+4 1,y) — Lx, y)) - h€, y), x,y). 


Remark 3 This concept introduced in [8, 9], is motivated by the fact that the latter 
expression is similar to the classical formula for continuous ODEs: 


of(x,y) _ Ly) fx,y) 
ôx ôx ÔL(x, y) 


and hence is similar to a change of variable. Consequently, a linear length-ODE is a 
linear ODE over a variable ¢ once the change of variable t = @(x) is done. 


However, in the context of (classical) ODEs, considering functions over the reals 
is more natural than only functions over the integers. Call real function a function 
f :S, x +++ x Sq — S| X... Sy, where each S;, S; is either R, N or Z. A natural 
question about the characterisation of FPTIME for real functions arises, and not only 
discrete functions: we consider here computability over the reals in its most classical 
approach, namely computable analysis [30]. 


As a first step, the class 


H 


DEL’ = [0,1, ak, &(x), +, —, x, cond(x), 5; composition, linear length ODE] 


has been considered in [3, 2] where the authors get some characterisation of PTIME, 
but only for functions from the integers to the reals (i.e. sequences) while it would be 
more natural to characterise functions from the reals to the reals. 


More importantly, this was obtained by assuming that some non-analytic exact func- 
tion is among the basic available functions to simulate a Turing machine: cond valuing 
1 for x > ; and 0 for x < i. 


We prove first this is not needed, and mainly, we extend all previous results to real 
functions, furthermore covering not only time complexity but also space complexity. 
Consider 

xX x. 
23 
where £ : N — N is the length function, mapping some integer to the length of its 
binary representation, 5 : R — R is the function dividing by 2 (similarly for 3) and all 
other basic functions defined exactly as for LDL, but are considered here as functions 


LDL’? = [0,1, TÉ, &(x), +, —, tanh, composition, linear length ODE}, 


from the reals to reals. 
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Remark 4 This class is LDL but without the sg(x) function, nor the multiplication 
function, or LDL’ but without the cond function, nor the multiplication. This is done 
by adding the analytic tanh functions as a substitute (and adding x/3). 


Remark 5 We can consider N C Z C R but as functions may have different types 
of outputs, the composition is an issue. We consider, as this is done in [3, 2], that 
composition may not be defined in some cases: it is a partial operator. For example, 
given f : N > R and g : R > R, the composition of g and f is defined as expected, 
but f cannot be composed with a function such as h : N —> N. 


First, we improve Theorem 1.1 by stating FPTIME over the integers can be charac- 
terised algebraically using linear length ODEs and only analytic functions (i.e. no need 
for sign function). Since LDL” is about functions over the reals, and Theorem 1.1 is 
about functions over the integers, we need a way to compare these classes. Given a 
function f : R > R” sending every integer n € Nf to the vicinity of some integer 
of Nf, say at a distance less than 1/4, we write DP(f) for its discrete part: this is the 
function from N? > N” mapping n € Nf to the integer rounding of f(n). Given 
a class C of such functions, we write DP(C) for the class of the discrete parts of the 
functions of C. 


Theorem 1.3 DP(LDL°) = FPTIME N NN, 


Second, we improve [3]: Write LDL” for the class obtained by adding some effective 
limit operation similar to the one considered in [3] to get LDIL*. We get a character- 
isation of functions over the reals (and not only sequences as in [3]) computable in 


polynomial time. 


Theorem 1.4 (Generic functions over the reals) LDL° N RÈ = FPTIMEN RË 


More generally: LDL? N RNR” _ FPTIME A RNxR* 


We also prove that, by adding a robust linear ODE scheme (Definition 2.7), we get a 
class RLD” (this stands for robust linear derivation) with similar statements but for 
FPSPACE. 


Theorem 1.5 DP(RLD°) = FPSPACE N NN. 


Theorem 1.6 (Generic functions over the reals) RLD° N R® = FPSPACE N R? 
More generally: RLD? N RN’x®* = FPSPACE N RN’xR’, 
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As far as we know, this is the first time a characterisation of FPSPACE with discrete 
ODEs is provided. If we forget the context of discrete ODEs, FPSPACE has been 
characterised in [29] but using a bounded recursion scheme, i.e. requiring some explicit 
bound in the spirit of Cobham’s statement [16]. We avoid this issue by considering 
numerically stable schemes, which are very natural in the context of ODEs. 


At a technical level, all our results are obtained by proving Turing machines can be 
suitably simulated with analytic discrete ODEs. We believe our constructions could be 
applied to many other situations, where programming with ODEs is needed. 


More on related works In addition to the previous state-of-the-art discussions, we 
comment here on some aspects: Our ways of simulating Turing machines have some 
reminiscence of similar constructions used in other contexts such as Neural Networks 
[28, 27]. In particular, we use a Cantor-like encoding set Z with inspiration from 
these references. These references use some particular sigmoid function o (called 
sometimes the saturated linear function) that values 0 when x < 0, x forO <x < 1, 
1 for x > 1. The latter is equivalent to cond(} + $x), for the function considered in 
[3, 2] and hence their constructions can be reformulated using the cond function. We 
completely avoid this, by considering the tanh function, which is more natural in the 
context of formal neural networks. The models considered in [28, 27] are recurrent, 
while our constructions are not recurrent neural networks, and second, their models 
are restricted to live on the compact domain [0,1], which forbids getting functions 
from R — R, while our settings allow more general functions. Our proofs also require 
functions taking some integer arguments, that would be impossible to consider in their 
settings (unless at the price of an artificial recoding). 


Remark 6 In some sense, our constructions can be seen as operators that map to a 
family of neural networks in the spirit of these models, instead of considering fixed 
recurrent neural networks, but also dealing with tanh, and not requiring the saturated 
linear function. 


The question of whether Turing machines can be simulated by recurrent neural networks 
of finite size, using the tanh activation function (and not the “exact” saturated linear 
function) is a well-known long-standing open problem. Despite some attempts, such 
as the one described in [27], up to our knowledge, there remains some of the statements 
not yet formally proved, or at least not fully generally accepted, in the existing proofs. 
Our statements are dealing with the tanh activation function, in some sense, but we 
avoid this open question by restricting it to finite space or time computations. By the 
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way, our proofs state this is possible if the space or the time of the machine is bounded, 
up to some controlled error. 


In the context of neural network models, there have been several characterisations of 
complexity classes over the discrete (see e.g. the monograph [27] about the approach 
discussed above, but not only), as far as we know, the relation between formal neural 
networks with classes of computable analysis has never been established before. 


If we do not restrict ourselves to neural network-related models, as in all these previous 
contexts, as far as we know, only a few papers have been devoted to characterisations of 
complexity, and even computability, classes in the sense of computable analysis. There 
have been some attempts using continuous ODEs [7], that we already mentioned, or 
the so-called IR-recursive functions [11]. For discrete schemata, we only know [12] 
and [25], focusing on computability and not complexity. 


Organisation of the article In Section 2, we recall some basic statements about the 
theory of discrete ODEs. More details can be found in Appendix B, mostly repeated 
from [8, 9, 3], to be self-content. In Section 3, we establish some properties about 
particular functions required for our proofs. In Section 4 we prove our main technical 


result: Turing machines can be simulated using functions from LDL°. Section 5 is 
about converting integers and reals (dyadic) to words of a specific form. Section 6 is 
about applications of our toolbox. We prove in particular all the above theorems. 


About appendices In this article, when we say that a function f : $1 x---x S4 —> R| 
is (respectively: polynomial time or space) computable this will always be in the sense 
of computable analysis: see e.g. [13, 30]. As we did not find a reference where the 
case of functions mixing integer and real arguments is fully formalised, we proposed a 
formalisation in [3]. In order to be self-content, we repeat it in Appendix A. In order 
not to expect our readers to be familiar with the theory of discrete ODEs, as we already 
wrote, we repeat some basic statements in Appendix B, mostly repeated from [8, 9]. 
As we need some results from [3], we also repeat their proof in Appendix C. 


Note about current version vs final version This article is the journal version of an 
article accepted at MFCS’2023 [4]. 
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2 Some concepts related to discrete ODEs 


In this section, we recall some concepts and definitions from discrete ODEs, either 
well-known or established in [8, 9, 3]: more details, repeated from these references, 
are provided in Appendix B. 


In order to get a uniform presentation, we consider here that tanh is tanh, the hyperbolic 
tangent. The papers [8, 9] use similar definitions with the sign function sg and [3] with 
the piecewise affine function cond, which values 1 for x > ; and 0 for x < 1, instead 
of tanh. 


Definition 2.1 ([3]) A tanh-polynomial expression P(x1,...,Xņp) is an expression 
built-on +, —, x (often denoted -) and tanh functions over a set of variables V = 
{x1, ...,x,} and integer constants. 


We need to measure the degree, similarly to the classical notion of degree in a poly- 
nomial expression, but considering all subterms that are within the scope of a tanh 
function contributes to 0 to the degree. 


Definition 2.2 ([3]) The degree deg(x, P) of aterm P in x € V is defined inductively 
as follows: 


e deg(x,x) = 1 and for x’ € V U Z such that x’ 4 x, deg(x, x’) = 0; 
e deg(x,P + Q) = max{deg(x, P), deg(x, Q)}; 

e deg(x,P x Q) = deg(x, P) + deg(x, Q); 

e deg(x, tanh(P)) = 0. 


A tanbh-polynomial expression P is essentially constant in x if deg(x, P) = 0. 


A vectorial function (resp. a matrix or a vector) is said to be a tanh-polynomial 
expression if all its coordinates (resp. coefficients) are, and essentially constant if all 
its coefficients are. 


Definition 2.3 ([8, 9, 3]) A tanh-polynomial expression g(f(x, y), h(x, y),x,y) is 
essentially linear in f(x,y) if it is of the form: A[f(x, y), h(x, y), x,y] - f(x,y) + 
B[f(x, y), h(x, y),x, y] where A and B are tanh-polynomial expressions essentially 
constant in f(x, y). 


For example, 
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e the expression P(x, y,z) = x- tanh (x* — z)- y + y? is essentially linear in x, 
essentially constant in z and not linear in y. 

e The expression P(x, 2°) 2) = cond(x? — z) - z? + 20) is essentially constant 
in x, essentially linear in 2“ (but not essentially constant) and not essentially 
linear in z. 


e The expression: z+ (1 — tanh(x))- (1 — tanh(—x)) - (vy — z) is essentially constant 
in x and linear in y and z. 


Definition 2.4 (Linear length ODE [8, 9]) A function f is linear length-ODE de- 
finable from u essentially linear in f(x,y), g and h if it corresponds to the solution 
of 


Of 
(2-1) fO,y) =g(y) and ane = u(f(x, y), h(x, y), x, y). 


A fundamental fact is that the derivation with respect to length provides a way to do 
some change of variables: 


Lemma 2.5 ([8, 9]) Assume that (2-1) holds. Then f(x,y) is given by f(x,y) = 
F(x), y) where F is the solution of the initial value problem 


OE, y) 
(2-2) F0, y) =g), and —,——=w(@,y), h(2' — 1, y), 2° — 1,y). 
This means f(x, y) depends only on the length of its first argument: f(x, y) = f(2°,y). 
Then (2-2) can be seen as defining a function (with this latter property) by a recurrence 
of type 


(2-3) f(2°,y)=g(y), and f£(2'*',y) = UE, y), h — 1,y),2',y). 


for some U is essentially linear in f(2', y). As recurrence (2-2) is basically equivalent 
to (2-1): 


Corollary 2.6 (Linear length ODE presented with powers of 2) A function f is 
linear £-ODE definable iff the value of f(x,y) depends only on the length of its first 
argument and satisfies (2-3), for some g and h, and U, essentially linear in f(2', y). 


We guess it is easier for our reader to deal with recurrences of the form (2-3) than with 
ODEs of the form (2—1). Consequently, this is how we will describe many functions 
from now on, starting with some basic functions, authorising compositions, and the 
above schemes. As an example, ++ 2” can easily be defined that way. Consider: 


2 = 
grr = 2-27 = 9" 49" 
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Similarly, we can produce n ++ 2?) for any polynomial p. For example, 
(ny,..., np) > QM 


can be obtained, using k such schemes in turn, providing the case of the polynomial 
p(n) = nk, 
When talking about space complexity, we will also consider the case where the ODE 


is not derivated with respect to length but with classical derivation. For functions over 
the reals, an important issue is numerical stability. 


Definition 2.7 (Robust linear ODE [8, 9]) A bounded function f is robustly linear 
ODE definable from u essentially linear in f(x, y), g and h if: 


(1) it corresponds to the solution of 


Of 
(2-4)  £(,y)= ely) and oy) = u(f(x, y), h(x, y),x,y), 


(2) where the schema (2-4) is polynomially numerically stable. 


Here, writing a =, b for ||a — b|| < 2~” for conciseness, 2. means formally there 
exists some polynomial p such that, for all integer n, writing e(n) = p(n + &(y)), if 
you consider any solution of 


y =en) Y 
h(x, ý) =en hQ, ¥) 
f0,9) =en gY) 
WD Sen uE, Y) ie, Y), x Y) 
then 
f(x, 9) =en) f, y). 


This corresponds roughly to the concept of polynomially robust to precision considered 
in [5], and turns out to be a very natural concept for functions defined on a compact. 


Remark7 Forlinear length ODEs, we did not have to put explicitly numerical stability 
as a hypothesis, as it comes free from the fact that we consider solutions at most at 
some logarithmic value of their arguments. But this is required here to guarantee the 
computability of the solution (and even polynomial space computability). 


Remark 8 Notice that, over the continuum, even computable ODEs may have no 
computable solution [26]. Over the discrete, not all dynamics can be simulated, and 
numerical stability is indeed an issue. 
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Remark 9 We believe the hypothesis that f is bounded can be relaxed to: “the 
function does not grow as much as the exponential of a polynomial in the length of its 
arguments”, i.e. not more than a function of FPSPACE, from arguments similar to the 
ones of [29] about functions over the integers. 


3 Some results about various functions 


A key part of our proofs is the construction of very specific functions in LDL°: we 
write {x} for the fractional part of the real x, i.e. {x} =x — |x]. 


A first observation is that we can uniformly approximate the ReLU(x) = max(0, x) 
function using an essentially constant function: 


Lemma 3.1 Consider (see Figure 1) 


1 + tanh(2”+?x) 


Y(x, IMEZ == 5 


For all integer m, for all x € R, 
| ReLU(x) — xY (x, 2” T] < 2”, 


Figure 1: Graphical representation of xY (x, 27+?) obtained with maple. 


To prove Lemma 3.1, we start by the following basic facts about function tanh. 


Lemma 3.2 |1 + tanhx| < 2 exp(—2|x|) for x € (—co, OJ. 
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Proof For x < 0, |1 + tanhx| = 1 + tanhx, and |x| = —x. We have f(x) = 
2 exp(2x) — tanh(x) — 1 = 2 exp(2x) mess t= en >0. 


Lemma 3.3 |1 — tanh x| < 2 exp(—2|x|) for x € [0, +00). 


Proof This follows from Lemma 3.2, using the fact that tanh is odd. 


Proof of Lemma 3.1 Let m € N. Consider Y(x, K) = 2E | with K > 0. 


For 0 < x, ReLU(x) = x, and | ReLU(x)—x¥(x, K)| = 3|1—tanh(Kx)| < xexp(—2Kx) 
from Lemma 3.3. 

For x < 0, ReLU(x) = 0, and |ReLU(x) — xY(x,K)| = Pliy + tanh(Kx)| < 
|x| exp(—2K |x|) from Lemma 3.2, which is the same expression as above for 0 < x. 
Function g(x) = xexp(—2Kx) has its derivative g'(x) = exp(—2Kx)(1 — 2Kx). We 
deduce the maximum of this function g(x) over R is in 5K? and that the maximum 


value of g(x) is aK: 


Consequently, if we take K = 2+2 then g(x) < 27” for all x, and we conclude. 


We deduce we can uniformly approximate the continuous sigmoid functions (when 


1/(b — a) is in LDL?) defined as: s(a,b,x) = 0 whenever w < a, ;—“ whenever 


a<x<b,and 1 whenever b < x. 


Lemma 3.4 (Uniform approximation of any piecewise continuous sigmoid) As- 


sume a,b, ro is in LDL°. Then there is some function (illustrated by Figure 2) 
C-s(z,a,b,x) E€ LDL? such that for all integer m, 


| C-5(2", a, b,x) — s(a,b,x)| < 27”. 


Proof We can write s(a,b,x) = ee Consider C-s(z,a,b,x) = 
Y : gite x—b)Y' b, Qi+e 2.2-m—-1-¢ 
(x—a) ¥(x—a,z m WYa— b2 °) Thus, | C-s(2™+1+c, a, b,x) — s(a, b, x)| < 22, 


using the triangle inequality. Take c such that -n <2" 


The existence of the following function will play an important role to obtain some 
various other functions. 


Theorem 3.5 There exists some function (illustrated by Figure 3) € : NÑ? > R in 
LDL” such that for all n,m € N and x € [—2",2"], whenever x € [|x| + ł, lx] +7, 


ke,» — fx — 7 LD 
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Figure 2: Graphical representation of C-s(2, 5, 3.x) and C-s(2°, 5, 3x) obtained with maple. 


Proof If we take €’ that satisfies the constraint only when x > 0, and that values 0 
for x < 0, then 3 — €'(-,+, —x) would satisfy the constraint when x < 0, but values 
3/4 for x > 0. So, 

3 
4 
would work for all x. So it remains to construct €’ such that for all n € N, x € [0,2”] 
and m € N, whenever x € [|x] + $, |x] + 2], |€(2,2",x) — {x — g}| < 27”, and 
|€(2",N,x) — 0] < 27™ for x < 0. 


1 
E2”, N, x) = CO" N, x) ~~ CON, =x) T ; a C-s(2*?, 0, p” 


Let s(x) = ġ s(lx] + $, |x| + 4,x). Over [|x] +4, |x] + £], we have s(x) = {x — $}. 
Actually, we will even construct €’ with the stronger properties that whenever x € 
El] +4 27, [x] + g2], E, 2", — s(x — | S20". 
It suffices to define €’ by induction by 

E", 20? x) = 3 C-s(2”, Ł, 4, x) 

O82" x) = CO 28 FO 2, x)) 


where 


1 3 
FQ”+! BD Hk RCO K + api + zo 


Let Ix] be [|x] 4 t, lx] +4], x € I x» and let us first study the value of F(2”"+!, 2”, x): 


e If x < 2", by definition of C-s, |F(2"t!, 2", x) — x| <2-@*”, with x € Tx]. 
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VI 


0.64 


Figure 3: Graphical representations of €(2,4,x) obtained with maple: some details on the 
right. 


e Tf 2"4+ 1 <x then [F(2"+1, 2", x) — (@—2”)| < 2-4) with x— 2" € lyja. 


Now, the property is true by induction. Indeed, it is true for n = 0 by definition of 
€/(2™, 2°, x). We now assume it is true for some n € N. We have €/(2",2"t!, x) = 
E(am™t! 2" F(2™+! 2” x)). Thus, by induction hypothesis, |¢/(2”+!, 2”, F(2™+!, 2”, x))— 
s(F(2™*1, 2" x) — 1/8)| < 2-@*, Now: 


e If x < 2”, by definition of C-s, |F(2”+!, 2”, x) — x| < 27+), and as s is 1- 
Lipschitz, |s(F(2"*!, 2”, x) — $) — s(x — 3)| < |FQ2"*1,2",x) — x| < 27D, 
Consequently, |¢’(2”,2"*!, x) — s(x — D) < 2~” and the property holds for 
n+l. 


e Tf 2"+3 < x then |F(2”+!, 2”, x) — (x—2")| < 270+D and as s is 1-Lipschitz, 
|s(F(2"*1, 2", x) — $) — sœ — 2" — E| < JEH, 2, x) — x 2| S27, 
Consequently, |€’(2”, 2"*!, x) — s(x — 2” — D| < 2™™ and the property holds 
forn+ 1. 


There remains to prove that the function €’ is in LDL°. Unfortunately, this is not 
clear from the recursive definition, but this can be written in another way, from 
which this follows. Indeed, we have from an easy induction that €/(2”,2",x) = 
FQ! 20 POM 2 ol pant 2? (..., F(2”,2”7!, x))))), if we define 


17 
gy gA 


FORD 3) = CO" 22) = CW" 7, 22) 
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Figure 4: Graphical representation of €|(2,4,x) and &(2,4, x) obtained with maple. 


Then, we can obtain €/(2”",2”,x) = H(2”,2"—!, 2”, x) with 
He" 2) 2 = Fe 2” x) 
AO 2 Sto 2 HO To 
= H(2” 2t gn x) at pao gn 1-t gn 1-t 


1 
=F g H2", 2,2", x) 


Such a recurrence can be then seen as a linear length ordinary differential equation, in 
the length of its first argument. It follows that €’ is in LDL”. 


From the construction of the previous functions, we obtain a bestiary of various func- 
tions 


Corollary 3.6 There exists (see Figure 4) €,& : N? x Rts R € LDL? such that, 
for alln,m € N, |x| € [-2"+1,2"], 


e whenever x € [|x| — 5, |x| + 4], 
|E (2”, 2”, x) = {x}| < 28, 
e and whenever x € [|x], |x| + 31, 


[E(2", 2”, x) — {x} eo, 
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Figure 5: Graphical representation of 0)(2,4,x) and o2(2,4,x) obtained with maple. 


Proof Consider 4 i 
E2”, N, x) = E2”, N,x = 3) = 7 


and 


7 
E(2”, N, x) = &(2",N,x = 3) 


Corollary 3.7 There exists (see Figure 5) 01,02: N? x Rt» R € LDL? such that, 
for alln,m € N, |x| € [-2"+ 1,2"), 


e whenever x € [|x| — 4, |x] + 4]. 
eye 2) = |x]| <27”, 
= 3 
e and whenever x € h = [|x], |x| + 41 


|oo(2™, 2”, x) — [x]| < 27. 


Proof Consider o;(2”, x) = x—€;(2”, x) with the function defined in Corollary 3.6. 


Corollary 3.8 There exists (see Figure 6) \ : N? x R > [0,1] € LDL? such that 
for all m,n € N, |x] € [—2"+1,2"], 


e whenever x € [|x| + 4, |x] +5]. 


q> 
|A(2”™, 2) ~~ 0| < a 
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Figure 6: Graphical representation of A(2, 4, x) obtained with maple. 


e and whenever x € [|x] 4 3, |x! + 1], 


AR”, 2 x) 4) a, 


Proof Consider \(2”, 2”, x) = F(€(2™*!, 2”, x — 9/8)) where 
F@) = C-s(2”*! 1/4, 1/2,3). 


Corollary 3.9 There exists (see Figure 7) modz : N? x Re [0,1] € LDL? such 
that for all m,n € N, |x| € [—2" + 1,2”], whenever x € [|x] — 4, |x] + 4]. 


—1i, 
| mod,(2”, 2”, x) — |x| mod 2| < 27”. 
Proof We can take 
1 7 
mod2(2”, N, x) = 1 — A(2”, N /2, 5* + —) 


8 
where A is the function given by Corollary 3.8. 


Corollary 3.10 There exists (see Figure 8) +2 : N? x Re [0,1] EL 


L° such that 


for all m,n € N, |x| € [-2" + 1,2"], whenever x € [|x] — $, [x] + 41. 
| +2 (2™”, 2", x) T [x] //2| £ 2, 


where // is the integer division. 
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Figure 7: Graphical representation of mod2(2,4,x) obtained with maple. 


Proof We can take 


1 
22" N, x) = z OIQ",N, x) = mod(2”, N, x)) 


where mod is the function given by Corollary 3.9, and o» is the function given by 


Corollary 3.7. 


3.1 Some properties of sigmoids 


Observing that for function 


if(d, © = 45(1,2,1/2+d+ 2/4) — 


for £ € [0,1], we have = 
if0,4) = 0 
if(1, £) 


II 
a 


Applying Lemma 3.4 on this sigmoid, we get: 


Lemma 3.11 There exists C-if € LDL? such that, for £ € [0,1], 


e if we take |d' — 0| < 1/4, then |C-if(2”,d', 2) — 0| < 27”, 
e and if we take |d’ — 1| < 1/4, then |C-if(2”, d’, 2) — 4| < 27”. 
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Figure 8: Graphical representation of +2(2,4, x) obtained with maple. 


We start by proving the following lemma, which is about some ideal sigmoids: 
Lemma 3.12 For £ € [0,1], for —1/4 < d < 1/4, if(d,® = 0, and for 3/4 < d < 
5/4, if(d, D) = 4d — 1) + £. 
Consider if(d, © = if(s(1/4,3/4,x), 2. 

e If we take |d' — 0| < 1/4, then if(d’, 2 = 0, 

e and if we take |d’ — 1| < 1/4, then if(d’, 0) = £. 


Proof Just check that ford < 1/4, wehave 1/2+d+¢/4 < 1,andhence s(1,2,1/2+ 
d+£/4) =0, so if(d, 2) = 0. For 3/4 < d < 5/4, we have 5/4 < 1/2+d+¢/4 < 2, 
and hence s(1,2, 1/2 +d + £/4) = d + £/4 — 1/2, and hence if(1, £) = £. The other 
observation follows. 


Then we go to versions using tanh: 
Lemma 3.13 Consider C-if(2”,d, D = 4C-s(2”+2,1,2,1/2 + d + 1/4) — 2. For 
£ € [0,1], we have |C-if(0, 2) — 0| < 2™, and |C-if(1, 2) — £| < 2™. 


For —1/4 < d < 1/4, |C-if(Q2",d,Q — 0| < 2™, and for 3/4 < d < 5/4, 
|C-if(2”,d, 2) — 4(d — 1) + 4| < 2™. 


Consider C-if(2”, d, 0) = C-af(2"*!, C-5(2"+1, 1/4,3/4,x), 0). 
e If we take |d’ — 0| < 1/4, then |C-if(d’, O — 0| < 27”, 
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e and if we take |d’ — 1| < 1/4, then |C-if(d’, 2) — 4| < 27". 


Proof This is direct from previous lemma and Lemma 3.4. 


Then Lemma 3.11 follows. 


Lemma 3.14 Let œi, @2,..., œn be some integers, and V1, V2,...,V, some con- 
stants. There is some function in LDL’, that we write C-send(2”, aj œ> Viie{i,....n}> 
that maps any x € [a; — 1/4,a; + 1/4] to a real at a distance at most 2~” of V;, for 
alli € {1,...,n}. 


We prove the following lemma about some ideal sigmoids: define cond(x) as 5(1/4, 3/4, x) 
and C-cond(2”, x) as C-s(2”, 1/4,3/4, x). 


Lemma3.15 Assume you are given some integers a1, 02,..., Qn, and some constants 
V1, V2, .--, Vn. Then there is some function in LDL? , written send(a; ++ Vi)ie{1,....n}> 
that maps any x € [a;i — 1/4,a; + 1/4] to Vj, for alli € {1,...,n}. 


Proof Sort the a; so that aj < ay < ...,@n. Then consider Tı + cond(x — a1)(T2 — 
T1) + cond(x — a2)(T3 — E3) +--+ + cond(x — an—1)(Tn — Tn-1). 


We now go to versions with tanh. 


Proof of Lemma 3.14 Sort the a; so that aj < az <...,a@,. Then consider Tı + 
C-cond(2” +° x—a1)(T2 —T) +C-cond(2"*,, x—a2)(T3 — T2) +: -- +C-cond(2™**, x— 
Qn—1)(Tn — T,—1). for some constant c so that n max;(T; — Tj41) < 2°. 


More generally: 


Lemma 3.16 Let N be some integer. Let a, @2,...,@n be some integers, and V; ; 
for 1 < i < n some constants, with O < j < N. Then there is some function 
in LDL, that we write C-send(2”, (ai, j) > Vijie{i,....n} j€{0,...,N—1}> that maps any 
x € [aj — 1/4,a; + 1/4] and y € [j — 1/4,j + 1/4] to a real at a distance at most 2~" 
of Vij, for alli € {1,...,n},7 € {0,...,N—1}. 


We start by proving the following lemma talking about some ideal sigmoids: 
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Lemma 3.17 Let N be some integer. Assume some integers 1,02,...,Q,, and 
some constants V;; for 1 <i <n, and 0 <j < N are given. Then there is some 
function in LDL’, that we write send((a;,j) > Vijie{i,....n} jE{0,...,N—-1}> that maps 
any x € [aj — 1/4,a; + 1/4] and y € [j — 1/4,j + 1/4] to Vij, for alli € {1,...,n}, 
J €{0,...,N— 1}. 


Proof If we define the function 


send((ai,j) 4 Vij ie (1,....n} je {1,...N} Os Y) 


by send(Na; +j > Vijiett,....n}je{i,....N}Nx + y) this works when x = a; for 
some i. Considering instead send(Na; + j +> Vi,)ic{1,....n} jef1,...,v} (N send(a; +> 
Qi )ie{1,...n}%) + y) works for any x € [aj — 1/4, a; + 1/4]. 


We then go to versions with tanh: 


Proof of Lemma 3.16 If we define the function 


C-send(2”, (ai, j) = Visdie(i,...n} ge{1-.-.N} OY) 


by C-send(2”, Na; +j > Vijie(,....n} jefl,....N}Nx + y) this works when x = a; for 
some i. Considering instead 


C-send(2”, Nai +j > Vi Diegi, n} Jef l,....N}N C-send(2”"*°, ai aiei, pOH) 


that works for any x € [a;i — 1/4, ai + 1/4], for some constant c selected as above. 


4 Simulating Turing machines with functions of LDL? 


This section is devoted to the simulation of a Turing machine using some analytic 
functions and in particular functions from LDL? . We use some ideas from [3] but with 
several improvements, as we need to deal with errors and avoid multiplications. 


Consider without loss of generality some Turing machine M = (Q, {0, 1,3}, dinit, ô, F) 
using the symbols 0, 1,3, where B = 0 is the blank symbol. 


Remark 10 The reason for the choice of symbols 1 and 3 will be made clear later. 
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We assume Q = {0,1,...,|Q| — 1}. Let ...l-gl-k+1 - . . l-iloror1 . .-rn.... denote 
the content of the tape of the Turing machine M. In this representation, the head is 
in front of symbol ro, and l;,r; € {0,1,3} for all i. Such a configuration C can be 
denoted by C = (q,/,r), where /,r € ©“ are words over alphabet © = {0,1,3} and 
q E€ Q denotes the internal state of M. Write: Ywora : XY — R for the function that 
maps a word w = wowiw2... to the dyadic Yword(w) = ee, wpA Ot) | 


The idea is that such a configuration C can also be encoded by some element C = 
(qyl, r € Nx R’, by considering F = Ywora(r) and l= Yword(). In other words, we 
encode the configuration of a bi-infinite tape Turing machine M by real numbers using 
their radix 4 encoding, but using only digits 1,3. Notice that this lives in Q x [0, 1]?. 
Denoting the image of Ywora : X” —> R by Z, this even lives in Q x T. 


Remark 11 Notice that Z is a Cantor-like set: it corresponds to the rational numbers 
that can be written using only 1 and 3 in base 4. We write Zs for those with at most 
S digits after the point (i.e. of the form n/4° for some integer n). 


A key point is to observe that 


Lemma 4.1 We can construct some function Next in LDL? that simulates one step 
of M: given a configuration C, writing C' for the next configuration, we have for all 
integer m, |\Next(2”,C) — C'|| < 2. 


Proof We can write / = lọl° and r = ror° , where lo and rọ are the first letters of / and 
r, and l and r° corresponding to the (possibly infinite) word l—ıl—2... and rir2... 
respectively. 


l r 

The function Next is of the form Next(q, l, r) = Next(q, llo, ror°) = (q', l’, r’) defined 
as a definition by case: 

Cis (q', L lox,r°) whenever 6(q, ro) = (q',x, >) 

pee (q', l, loxr*) whenever 6(q, ro) = (q',x, —) 
This provides a first candidate for the function Next: Consider the similar function 
working over the representation of the configurations as reals, considering ro = | 47| 

Next(q, 1,7) = Next(q,l*lo, ror*) = (q', l, r") 
= (Ce I*lox, r°) whenever oq, ro) = (d, x, —>) 

7 dq‘, p Ioxr*) whenever CE ro) = Ce x, +) 
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(4-1) B _ 
e in the first case “3”: V = 4-'1+4-!x andr’ = r° = {4r} 
e in the second case “4—” : V = T° = {41} and rf = 4-7{47} +4-7x + |41| /4 


We introduce the following functions: —: Q x {0,1,3} + {0,1} and +: Q x 
{0,1,3} +5 {0,1} such that — (q,a) (respectively: < (q,a)) is 1 when 6(g,a) = 
(-,-,—>) (resp. (-,-,<-)), i.e. the head moves right (resp. left), and O otherwise. 
We define nextgi = q' if 6(q,a) = (q',-,-), i.e. values (q',x,m) for some x and 
me{+,—}. 


We can rewrite Nexi(q, 1,7) = (q',1,7’) as 


= l - 
Vey. > (q, ro) (5 + *) + & (q,70) {al} 


qro 


and 


ENCER] 


q,ro 
and, using notation of Lemma 3.16, g’ = send((q, r) > NeXt} )gcO,re 0, 1,3} |4r]). 


Our problem with such expressions is that they involve some discontinuous functions 
such as the integer part and the fractional part function, and we would rather have 
analytic (hence continuous) functions. However, a key point is that from our trick of 
using only symbols 1 and 3, we are sure that in an expression like | 47|, either it values 
O (this is the specific case where there remain only blanks in r), or that 47 lives in an 
interval [1,2] or in interval [3, 4]. 


That means that we could replace |47| by o(47) if we take o as some continuous 
function that would be affine and values respectively 0, 1 and 3 on {0} U [1,2] U 
[3,4] (that is to say matches |47| on this domain). A possible candidate is a(x) = 
6(1/4,3/4,x) + 5(9/4, 11/4,x). Then considering €(x) = x — a(x), then €(4r) would 
be the same as {47}: that is, considering ro = (47), replacing in the above expression 
every {4-} by &(-), and every |-| by o(-), and get something that would still work the 
same, but using only continuous functions. 


But, we would like to go to some analytic functions and not only continuous functions, 
and it is well-known that an analytic function that equals some affine function on some 
interval (e.g. on [1,2]) must be affine, and hence cannot be 3 on [3, 4]. But the point is 
that we can try to tolerate errors, and replace s(-,-) by C-5(2**, -, -) in the expressions 
above for o and £, taking c such that (3 + 1/47)3|Q| < 2°. This would just introduce 
some error at most (3 + 1/47)3|Q|2~°2-™ < 27”. 


24 M. Blanc and O. Bournez 


Remark 12 We could also replace every — (q, r) in above expressions for T and 7 by 
C-send(k, (q,r) ~— (q,7r))(q, o(4F)), for a suitable error bound k, and symmetrically 
for + (q,r). However, if we do so, we still might have some multiplications in the 
above expressions. 


The key is to use Lemma 3.11: we can also write the above expressions as 
T=5,, [Cif (2"*,C-send(2?, (4,1) >> GG, 047), $ + 3) 
+ C-if (27e, C-send(2?, (q, r) > (q, r)\(q, a(4r)), &(41)) | 
r= Yi, [Cf (2"+°, C-send(2?, (q,r) > (q, Dq, 7(47)), €(47)) 


+ Cif (20+, C-send(2?, (q,r) >e (q, DX, 0147), GP + g + D) 


and still have the same bound on the error. o 


Once we have one step, we would like to simulate some arbitrary computation of a 
Turing machine, by considering the iterations of function Next. 


The problem with the above construction is that even if we start from the exact encoding 
C of a configuration, it introduces some error (even if at most 27™). If we want to 
apply again the function Next, then we will start not exactly from the encoding of a 
configuration. Looking at the choice of the function ø, a small error can be tolerated 
(roughly if the process does not involve points at a distance less than 1/4 of Z), but this 
error is amplified (roughly multiplied by 4 on some component), before introducing 
some new errors (even if at most 2~”). The point is that if we repeat the process, very 
soon it will be amplified, up to a level where we have no true idea or control about 
what becomes the value of the above function. 


However, if we know some bound on the space used by the Turing machine, we 
can correct it to get at most some fixed additive error: a Turing machine using a 
space S uses at most S cells to the right and to the left of the initial position of its 
head. Consequently, a configuration C = (q,l,r) of such a machine involves words 
l and r of length at most S. Their encoding l, and F are expected to remain in 
Ts11. Consider rounds,,() = |4°*!7|/45+!. For a point 7 of Zs41, 45+! is an 
integer, and / = rounds, (J). But now, for a point I at a distance less than 4—S+2) 
from a point I € Zs,1, rounds,,(/) = 1. In other words, rounds, “deletes” errors 
of order 4~“S+?), Consequently, we can replace every / in the above expressions by 
0 (275+4 22S5+3 4541] /45+1 as this is close to rounds, ;(/), and the same for F, where 
gı is the function from Corollary 3.7. We could also replace m by m + 2S + 4 to 
guarantee that 2~” < 4~©+?), We get the following important improvement of the 
previous lemma: 
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Lemma 4.2 We can construct some function Next in LDL’ that simulates one step of 

M, i.e. that computes the Next function sending a configuration C of Turing machine 
= : Z S/ 

M to C, where C’ is the next one: ||Next(2,2°,C) — C || < 27”. 


Furthermore, it is robust to errors on its input, up to space S: considering ||C — C|| < 
4-S+2)_ || Next(2™, 25, Č) — C'|| < 2-” remains true. 


Proposition 4.3 Consider some Turing machine M that computes some function 
f: &* — &* in some time T(€(w)) on input w. One can construct some function 
J :N? xR —> R in LDL? that does the same: f(2”,27@™), Yword(w)) that is at most 
2~™ far from Yword(f(w)). 


Proof The idea is to define the function Exec that maps some time 2’ and some initial 
configuration C to the configuration at time t. This can be obtained using previous 
lemmas by 


Exec(2™,0,27,C) =C 
Exec(2”,2!+! 27 C) Next(2” 27 , Exec(2”, 2,2", C)). 


We can then get the value of the computation as Exec(2”, TCO) aE) Cinit) On 
input w, considering Cini = (Go, 9, Ywora(w)). By applying some projection, we get 
the following function f(2”,27,y) = 73}(Exec(2”, 27,27, (qo, 0, y))) that satisfies the 
property. 


Actually, in order to get FPSPACE, observe that we can also replace the linear length 
ODE by a linear ODE. 


Proposition 4.4 Consider some Turing machine M that computes some function 
f: &* — X* in some polynomial space S((w)) on input w. One can construct some 
function f : N? xR > R in RLD” that does the same: we have f (2”, 2°") , yyord(w)) 
that is at most 2~™ far from Ywora(f(w)). 


Proof The idea is the same, but not working with powers of 2, and with linear ODE: 
define the function Exec that maps some time ¢ and some initial configuration C to the 
configuration at time t. This can be obtained the using previous lemma by 


Exec(2”,0, 2°, C) =C 
Exec(2”,t +1,25, C) Next(2” 25, Exec(2”, t,2°, C)). 
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In order to claim this is a robust linear ODE, we need to state that Exec(2”, t, 2°, C) is 
polynomially numerically stable: but this holds, since to estimate this value at 2~” it is 
sufficient to work at precision 47-52) (independently of t, from the rounding). 


We can then get the value of the computation as Exec(2™, 25) 25) Cinit) On 
input w, considering Cini = (qo, 0, Yword(W)). By applying some projection, we get 
the following function f(2”, 25, y) = 13(Exec(2™, S,25, (qo, 0, y))) that satisfies the 
property. 


5 Converting integers and dyadics to words, and conversely 


One point of our simulations of Turing machines is that they work over Z, through 
encoding Yword, While we would like to talk about integers and real numbers: we need 
to be able to convert an integer (more generally a dyadic) into some encoding over Z 
and conversely. 


Fix the following encoding: every digit in the binary expansion of d is encoded by a 
pair of symbols in the radix 4 expansion of d € ZN [0, 1]: digit 0 (respectively: 1) is 
encoded by 11 (resp. 13) if before the “decimal” point in d, and digit O (respectively: 
1) is encoded by 31 (resp. 33) if after. For example, for d = 101.1 in base 2, 
d = 0.13111333 in base 4. 


Lemma 5.1 (From N to Z) We can construct some function Decode : Ñ? —> R in 
LDL” that maps m and n to some point at distance less than 2~™ from Ywora(). 


Proof Recall the functions provided by Corollaries 3.9 and 3.10. The idea is to iterate 
&(n) times function 


(+2(F1), (l2 +5)/4) whenever mod2(77) = 0 
(+2(F7), (2 +7)/4) whenever mod2(77) = 1. 


Fr, h) = { 
over (n, 0), and then projects on the second argument. 


This can be done in LDL” by considering F(2”, 2” , 71, l2) = C-send(2’”"*!, 
Or (=2(2"+1, 2M, FT), (h +5)/4), 
Ly (2(2"41, 2M, FT), (bh + 7)/AXCFT), and then 


Decode(2", n) = 13(G(2™* ©), 2°41 9 n,0)), 


with 
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GQ" 2" 22) 160): = (n,0) 
GO" OM 21 r D = FO" 2". GO” 2r, D). 


The global error will be at most 2T) x Hany <22”. 


This technique can be extended to consider decoding of tuples: there is a function 
Decode : N+! — R in LDL” that maps m and n to some point at distance less than 
2~” from Ywora (N), with n defined componentwise. 


Conversely, given d, we need a way to construct d. Actually, as we will need to avoid 
multiplications, we state that we can even do something stronger: given d, and (some 
bounded) A we can construct Ad. 


Lemma 5.2 (From Z to R, and multiplying in parallel) We can construct some 
function EncodeMul : N? x [0,1] x R > R in LDL” that maps m, 25, Yword(d) and 
(bounded) A to some real at distance at most 27™™ from àd, whenever d is of length 
less than S. 


Proof The idea is to do as in the proof of previous lemma, but considering 


(a(16rD, 2h + 0, A) whenever i(167;) = 5 
(a(167)), 2/2 + À, A) whenever i(167;) = 7 
(a(16F7), (l2 + 0)/2,) whenever i(1677) = 13 
(a(16F7), (l2 + A)/2, à) whenever i(167;) = 15 


F, h, A) = 


iterated S times over suitable approximation of the rounding rounds+1(Ywora(d), 0, A)), 
with o and € constructed as approximation of the integer and fractional part, as 


before. 


6 Proofs and applications 


Theorem 1.3 follows from point 1. of next Proposition for one inclusion, and previous 
simulation of Turing machines for the other. 


We start by stating the following (similar to the statement about LDL” in [3]): 


Proposition 6.1 (1) All functions of LDL? are computable (in the sense of com- 


putable analysis) in polynomial time. 
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(2) All functions of RLD are computable (in the sense of computable analysis) in 
polynomial space. 


Proof The proof consists in observing this holds for the basic functions and that 
composition preserves polynomial time (respectively: space) computability and also 
by linear length ODEs. This latter fact is established by computable analysis arguments, 
reasoning on some explicit formula giving the solution of linear length ODE. The proof 
is similar to the statement about LDL” in [3]. In order to be self-content, we repeat in 
Appendix C, how this is established in [3]. 


Regarding space, the main issue is the need to prove the schema given by Definition 
2.7 guarantees f is in FPSPACE when u, g, and h are. Assuming condition 1. of 
Definition 2.7 would not be sufficient: the problem is that f(x, y) may polynomially 
grow too fast or have a modulus function that would grow too fast. The point is, in 
Definition 2.7, we assumed f to be both bounded and satisfying 2., i.e. polynomial 
numerical robustness. With these hypotheses, it is sufficient to work with the preci- 
sion given by this robustness condition and these conditions guarantee the validity of 
computing with such approximated values. 


We now go to various applications of the proposition and of our toolbox. First, we 
state a characterisation of FPTIME for general functions, covering both the case of a 
function f : N? > R”, f : R? > R” asa special case: only the first type (sequences) 
was covered by [3]. 


Theorem 6.2 (Theorem 1.4) A function f : R? x N” > R” is computable in 
polynomial time iff there exists f : R? x NI” +2 > R € LDL’ such that for all 
x € Rf, X €N, x € [-2*,2*], me N”, n € N, ||f(x, m, 2*, 2”) — f(x, m)|| < 27”. 


The reverse implication of Theorem 6.2 follows from Proposition 6.1, (1.) and argu- 
ments from computable analysis. 


Proof of reverse implication of Theorem 6.2 Assume there exists  : R¢x N?” +? — 
R” € LDL? such that for all x € Rf, X € N, x € [-2%,2*], me Nn EN, 
|f, m, a 2") _ f(x, m)|| < 2%, 


From Proposition 6.1, (1.), we know that f is computable in polynomial time (in the 
binary length of its arguments). Then f(x,m) is computable: indeed, given x, m and 
n, we can approximate f(x, m) at precision 2~” on [—2*, 2*] as follows: approximate 
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f(x,m,2*,2”+!) at precision 2~*+!) by some rational q, and output q. We will then 
have 
la ~~ f(x, m)|| < la = f(x, m, 2 at) || + ee m, 2, 2) > f(x, m)]|| 
< 27041) 4 2-01) 
<2. 


All of this is done in polynomial time in n and the size of m, and hence we get that f 
is polynomial time computable from definitions. oO 


For the direct implication, for sequences, that is to say, functions of type f : N” > R” 
(i.e. d = 0, the case considered in [3]) we are almost done: reasoning componentwise, 
we only need to consider f : N?” — R (i.e. d'= 1). As the function is polynomial 
time computable, this means that there is a polynomial time computable function 
g: NEH _, {1,3}* so that on m, 2”, it provides the encoding (m,n) of some 
dyadic ¢(m,n) with ||(m,n) — f(m)|| < 27” for all m. The problem is then to 
decode, compute and encode the result to produce this dyadic, using our previous 
toolbox. 


More precisely, from Proposition 4.3, we get % with 
|82, POD, Decode(2°, m, n)) — YWwora(g(m, n))| < 27° 


for some polynomial p corresponding to the time required to compute g, and e = 
max(p(max(m, n)), n). Then we need to transform the value to the correct dyadic: we 
mean 

f(m, n) = EncodeMul(2°, 2', 8(2°, 2', Decode(2°,m, n)), 1), 


where t = p(max(m,n)), e = max(p(max(m,n)),n) provides a solution such that 
fm, 2”) — f(m)|| < 2™”. 


Remark 13 This is basically what is done in [3], except that we do it here with 
analytic functions. However, as already observed in [3], this cannot be done for the 
case d > 1, i.e. for example for f : R — R. The problem is that we used the fact that 
we can decode: Decode maps an integer n to its encoding 7 (but is not guaranteed to 
do something valid on non-integers). There cannot exist such functions that would be 
valid over all reals, as such functions must be continuous, and there is no way to map 
continuously real numbers to finite words. This is where the approach of the article [3] 
is stuck. 


To solve this, we use an adaptive barycentric technique. For simplicity, and pedagogy, 
we discuss only the case of a polynomial time computable function f : R x N > R. 
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From standard arguments from computable analysis (see e.g. [Corollary 2.21][20]), 
the following holds and the point is to be able to realise all this with functions from 
LDL”. 


Lemma 6.3 Assume f : R x N — R is computable in polynomial time. There exists 
some polynomial m : N* — N and some f : Nt —> Z computable in polynomial 
time such that for all x € R, ||2~"f((2"™™x| , u, 2", 2") — f(x, u)|| < 27” whenever 
satay © [—2@,2™]. 


Assume we consider an approximation o; (with either i = 1 or i = 2) of the in- 
teger part function given by Lemma 3.7. Then, given n,M, when 2x falls in 
some suitable interval J; for o; (see the statement of Lemma 3.7), we are sure that 
oi(2°, 2"M+X+1 gm(n.M) x) is at some distance upon control from |2’""“)x|. Conse- 
quently, 2-"f(aj(2M™-M X41 gM) X) 4,2”, 2”) provides some 2—” -approximation 
of f(x,u), up to some error upon control. When this holds, we then use an ar- 
gument similar to what we describe for sequences: using functions from LDL”, 
we can decode, compute, and encode the result to provide this dyadic. It is pro- 
vided by an expression Formulaj(x,u,M,n) of the form EncodeMul(2°,2',f(27,2', 
Decode(2°, o;(2°,2™ , 27) x))), 27”). 


The problem is that it might also be the case that 2’"""x falls in the complement of 
the intervals (/;);. In that case, we have no clear idea of what could be the value of 
oi(2°, 2"0M)+X+1 Qm(n.M) x) and consequently of what might be the value of the above 
expression Formula;(x,u, M,n). But the point is that when it happens for an x for o1, 
we could have used o2, and this would work, as one can check that the intervals of type 
I, cover the complements of the intervals of type J and conversely. They also overlap, 
but when x is both in some J; and h, Formula,(x,u,M,n) and Formulaz(x,u,M,n) 
may differ, but they are both 2~” approximations of f(x). 


The key is to compute some suitable "adaptive" barycenter, using function À, provided 
by Corollary 3.8. Writing ~ for the fact that two values are closed up to some controlled 
bounded error, observe from the statements of Corollary 3.8 and 3.7 


e that whenever X(-,2”,x) © 0, we know that 02(-,2”,x) ~ |x]; 
e that whenever X(-,2”,x) © 1 we know that o1(-,2”,x) © |x]; 
e that whenever X(-,2”,x) € (0,1), we know that o1(-,2",x) ~ |x| +1 and 
o2(-,2",x) ~ [x]. 
That means that if we consider 


AC, 2”, x)Formula,(x,u,M,n) + (1 — AG, 2”, n))Formulaz(x, u,M,n) 
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we are sure to be close (up to some bounded error) to some 2~” approximation of 
f(x). There remains that this requires some multiplication with A. But from the form 
of Formula;(x,u,M,n), this could be also be written as follows, ending the proof of 
Theorem 6.2. 
(6-1) 
EncodeMul(2°, 2', f (2°, 2', Decode(2°, o (2°, 2” , 2") xy), (2°, 2M gin M) yya) 


EncodeMul(2°, 2', f(2°,2', Decode(2°, o(2°, 2” , 2") x), (1 (2%, 2M, 2M) xy") 


Proof of Theorem 1.3 We know that a function f : R? > R” from LDL? is poly- 
nomial time computable by Proposition 6.1, (1.). That means that we can approximate 
it with arbitrary precision, in particular precision l in polynomial time. Given such an 
approximation q, if we know it is some integer, it is easy to determine which integer 
it is: return (componentwise) the closest integer to q. 


Conversely, if we have a function f : N? > N” that is polynomial time computable, 
our previous simulations of Turing machines provide a function in LDL’? that computes 
it at any required precision, in particular 1/4. 


From the fact that we have the reverse direction in Theorem 6.2, it is natural to consider 
the operation that maps f to f. Namely, we introduce the operation ELim (this stands 
for Effective Limit): 


Definition 6.4 (Operation ELim) Given f : R? x N” x N > R” € LDL? such 
that for all x € R, X € N, x € |-2¥,2¥], m € N”, n € N, |[f(x,m,2*,2”) — 
f(x, m)|| < 27”, then ELim(f) is the (clearly uniquely defined) corresponding function 
f: R> R”. 


Theorem 6.5 A continuous function f is computable in polynomial time if and only 
if all its components belong to LDL? , where 


LDL? = [0,1, a, L(x), +, —, cond(x), - 5; composition, linear length ODE, ELim]. 


For the reverse direction, by induction, the only thing to prove is that the class of 
functions from to integers computable in polynomial time is preserved by the opera- 
tion ELim. Take such a function f. By definition, given x,m, X we can compute 
f(x,m,2¥,2”) with precision 27” in time polynomial in n. This must be, by defini- 
tion of ELim schema, some approximation of f(x, m) over [—2*, 2*], and hence f is 
computable in polynomial time. This also gives directly Theorem 1.4 as a corollary. 


From the proofs, we also get a normal form theorem, namely formula (6-1). In 
particular, 
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Theorem 6.6 Any function f : N¢ x R?” — R can be obtained from the class LDL? 
using only one schema ELim. 


We obtain the statements for polynomial space computability (Theorems 1.5 and 1.6) 
replacing LDL? by RLD”, using similar reasoning about space instead of time, con- 
sidering point 2. instead of 1. of Proposition 6.1, and Proposition 4.4 instead of 
Proposition 4.3. We now comment on the relations with formal neural network 
models. In this article, we are programming with tanh and sigmoids. We expressed 
the sigmoids in terms of the ReLU function, through Lemma 3.1. Function tanh could 
be replaced by arctan: the key was to be able to approximate the ReLU function with 
tanh (Lemma 3.1) and this can be proved to hold also for arctan, using error bounds 
on arctan established in [18]. 


. . $ . . 
Now, given some function f : R? > R” , some error and some time t, our expressions 
8 p 
provide explicit expressions in LDL?” of an approximation of what is computed by a 
Turing machine at time t uniformly over any compact domain. 


Remark 14 The formula 6-1 can be seen as a function that generates uniformly a 
family of circuits/formal C-s approximating a given function at some given precision 
over some given domain. The functions we generate are the composition of essentially 
linear functions, which can be considered as layers of formal neural networks! . 


References 


[1] Stephen Bellantoni and Stephen Cook. A new recursion-theoretic characterization of 
the poly-time functions. Computational Complexity, 2:97—110, 1992. 


[2] Manon Blanc and Olivier Bournez. A characterization of polynomial time computable 
functions from the integers to the reals using discrete ordinary differential equations. 
Submitted. Journal version of [3]. Preliminary version available on https: //arxiv. 
org/abs/2209. 13599. 


[3] Manon Blanc and Olivier Bournez. A characterization of polynomial time computable 
functions from the integers to the reals using discrete ordinary differential equations. 
In Jérôme Durand-Lose and György Vaszil, editors, Machines, Computations, and 
Universality - 9th International Conference, MCU 2022, Debrecen, Hungary, August 
31 - September 2, 2022, Proceedings, volume 13419 of Lecture Notes in Computer 
Science, pages 58-74. Springer, 2022. MCU’22 Best Student Paper Award. doi: 
16.1007 /978-3-031-13502-6\_4. 


With a concept of neural network that is not assuming that the last layer of the network is 
made of neurons, and that result may be outputted by some linear combination of the neurons 
in the last layer. 


FPTIME and FPSPACE over the reals characterised with discrete ODEs 33 


[4] 


Manon Blanc and Olivier Bournez. A characterisation of functions computable in 
polynomial time and space over the reals with discrete ordinary differential equa- 
tionssimulation of turing machines with analytic discrete odes. In Mathematical Foun- 
dations of Computer Science (MFCS’2023), 2023. 


Manon Blanc and Olivier Bournez. Measuring robustness of dynamical systems. 
relating time and space to length and precision, 2023. URL: https: //arxiv.org/ 
abs/2301.12723, doi:10.48550/ARXIV.2301.12723. 


Olivier Bournez. Programming with ordinary differential equations: Some first steps 
towards a programming language. In Ulrich Berger, Johanna N. Y. Franklin, Florin 
Manea, and Arno Pauly, editors, Revolutions and Revelations in Computability - 18th 
Conference on Computability in Europe, CiE 2022, Swansea, UK, July 11-15, 2022, 
Proceedings, volume 13359 of Lecture Notes in Computer Science, pages 39-51. 
Springer, 2022. doi: 10.1007/978-3-031-08740-0\_4. 


Olivier Bournez, Manuel L. Campagnolo, Daniel Graga, and Emmanuel S. Hainry. 
Polynomial differential equations compute all real computable functions on computable 
compact intervals. Journal of Complexity, 23(3):317-335, 2007. 


Olivier Bournez and Arnaud Durand. Recursion schemes, discrete differential equations 
and characterization of polynomial time computation. In Peter Rossmanith, Pinar 
Heggernes, and Joost-Pieter Katoen, editors, 44th Int Symposium on Mathematical 
Foundations of Computer Science, MFCS, volume 138 of LIPIcs, pages 23:1—23:14. 
Schloss Dagstuhl - Leibniz-Zentrum fiir Informatik, 2019. 


Olivier Bournez and Arnaud Durand. A characterization of functions over the in- 
tegers computable in polynomial time using discrete ordinary differential equations. 
Computational Complexity, 32(2):7, 2023. 


Olivier Bournez and Amaury Pouly. A universal ordinary differential equation. In 
International Colloquium on Automata Language Programming, ICALP’2017, 2017. 


Olivier Bournez and Amaury Pouly. A survey on analog models of computation. In 
Vasco Brattka and Peter Hertling, editors, Handbook of Computability and Complexity 
in Analysis. Springer, 2020. 

Vasco Brattka. Recursive characterization of computable real-valued functions and 
relations. Theoretical Computer Science, 162(1):45—77, 1996. 


Vasco Brattka, Peter Hertling, and Klaus Weihrauch. A tutorial on computable analysis. 
In New computational paradigms, pages 425—491. Springer, 2008. 

P. Clote. Computational models and function algebras. In Edward R. Griffor, editor, 
Handbook of Computability Theory, pages 589-68 1. North-Holland, Amsterdam, 1998. 
Peter Clote and Evangelos Kranakis. Boolean functions and computation models. 
Springer Science & Business Media, 2013. 


Alan Cobham. The intrinsic computational difficulty of functions. In Y. Bar-Hillel, 
editor, Proceedings of the International Conference on Logic, Methodology, and Phi- 
losophy of Science, pages 24—30. North-Holland, Amsterdam, 1962. 


34 


[17] 


[23] 


[24] 


[30] 


M. Blanc and O. Bournez 


Francois Fages, Guillaume Le Guludec, Olivier Bournez, and Amaury Pouly. Strong 
turing completeness of continuous chemical reaction networks and compilation of 
mixed analog-digital programs. In Computational Methods in Systems Biology-CMSB 
2017, 2017. CMSB’2017 Best Paper Award. 


Daniel S. Graça. Computability with Polynomial Differential Equations. PhD thesis, 
Instituto Superior Técnico, 2007. 


Daniel S. Graça and Ning Zhong. Handbook of Computability and Complexity in 
Analysis, chapter Computability of Differential Equations. Springer., 2018. 


Ker-I Ko. Complexity Theory of Real Functions. Progress in Theoretical Computer 
Science. Birkhaiiser, Boston, 1991. 


D. Leivant. Intrinsic theories and computational complexity. In LCC’94, number 960 
in Lecture Notes in Computer Science, pages 177-194, 1995. 


Daniel Leivant. Predicative recurrence and computational complexity I: Word recur- 
rence and poly-time. In Peter Clote and Jeffery Remmel, editors, Feasible Mathematics 
IT, pages 320-343. Birkhauser, 1994. 


Daniel Leivant and Jean-Yves Marion. Lambda calculus characterizations of Poly- 
Time. Fundamenta Informatica, 19(1,2):167,184, 1993. 


Daniel Leivant and Jean-Yves Marion. Ramified recurrence and computational com- 
plexity II: substitution and poly-space. In L. Pacholski and J. Tiuryn, editors, Computer 
Science Logic, 8th Workshop, CSL’94, volume 933 of Lecture Notes in Computer Sci- 
ence, pages 369-380, Kazimierz, Poland, 1995. Springer. 


Keng Meng Ng, Nazanin R Tavana, and Yue Yang. A recursion theoretic foundation of 
computation over real numbers. Journal of Logic and Computation, 31(7):1660-1689, 
2021. 


Marian Boykan Pour-El and J. Ian Richards. A computable ordinary differential equa- 
tion which possesses no computable solution. Annals of Mathematical Logic, 17:61-90, 
1979. 


Hava T. Siegelmann. Neural Networks and Analog Computation - Beyond the Turing 
Limit. Birkauser, 1999. 


Hava T. Siegelmann and Eduardo D. Sontag. On the computational power of neural 
nets. Journal of Computer and System Sciences, 50(1):132—150, February 1995. 


David B Thompson. Subrecursiveness: Machine-independent notions of computability 
in restricted time and storage. Mathematical Systems Theory, 6(1-2):3-15, 1972. 


Klaus Weihrauch. Computable Analysis: an Introduction. Springer, 2000. 


FPTIME and FPSPACE over the reals characterised with discrete ODEs 35 


A Concepts from computable analysis 


When we say that a function f : $4 X- +- X S4 > R” is (respectively: polynomial-time) 
computable this will always be in the sense of computable analysis: see e.g. [13, 30]. 
We recall here the basic concepts and definitions, mostly following the book [20], 
whose subject is complexity theory in computable analysis. This section is repeating 
the formalisation proposed in [3] done to mix complexity issues dealing with integer 
and real arguments: a dyadic number d is a rational number with a finite binary 
expansion. That is to say d = m/2” for some integers m € Z, n € N, n > 0. Let D be 
the set of all dyadic rational numbers. We denote by D, the set of all dyadic rationals 
d with a representation s of precision prec(s) = n; that is, D, = {m - 2" | m E€ Z}. 


Definition A.1 ({20]) For each real number x, a function ¢ : N — D is said to 
binary converge to x if for all n € N, prec(¢(n)) = n and |¢(n) — x| < 2”. Let CF, 
(Cauchy function) denotes the set of all functions binary converging to x. 


Intuitively, a Turing machine M computes a real function f the following way: 1. 
The input x to f, represented by some ọ € CF», is given to M as an oracle; 2. The 
output precision 2~” is given in the form of integer n as the input to M; 3. The 
computation of M usually takes two steps, though sometimes these two steps may be 
repeated an indefinite number of times; 4. M computes, from the output precision 
27”, the required input precision 2~”; 5. M queries the oracle to get (m), such that 
||o@n) — x|| < 2~”, and computes from ¢(m) an output d € D with ||d —f(x)|| < 
2~". More formally: 


Definition A.2 ((20]) A real function f : R —> R is computable if there is a function- 
oracle TM M such that for each x € R and each ¢ € CF,, the function Y% computed 
by M with oracle @ (i.e., wn) = M*(n)) is in CFfœ. We say the function f is 
computable on the interval [a,b] if the above condition holds for all x € [a, b]. 


Remark 15 Given some x € R, such an oracle TM M can determine some integer X 
such that x € [—2*,2*]. 


A.1 On computable analysis: Complexity 


Assume that M is an oracle machine computing f on a domainG. For any oracle 
o € CF,, with x € G, let Ty(¢,n) be the number of steps for M to halt on input n 
with oracle ¢, and Ty(x,n) = max {Ty(¢,n) | $ E€ CFx}. The time complexity of f 
is defined as follows: 
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Definition A.3 ([20]) Let G be bounded closed interval [a,b]. Let f : G — R bea 
computable function. Then, we say that the time complexity of f on G is bounded by 
a function t: G x N > N if there exists an oracle TM M which computes f such that 
for all x € G and all n > 0, Ty(x,n) < t(x,n). 


In other words, the idea is to measure the time complexity of a real function based on 
two parameters: input real number x and output precision 2~”. Sometimes, it is more 
convenient to simplify the complexity measure to be based on only one parameter, the 
output precision. For this purpose, we say the uniform time complexity of f on G is 
bounded by a function ¢’ : N — N if the time complexity of f on G is bounded by a 
function t : G x N > N with the property that for all x € G, t(x,n) < ¢(n). 


However, if we do so, it is important to realise that if we had taken G = R in the 
previous definition, for unbounded functions f, the uniform time complexity would 
not have existed, because the number of moves required to write down the integral part 
of f(x) grows as x approaches +00 or —oo. Therefore, the approach of [20] is to do as 
follows (The bounds —2* and 2* are somewhat arbitrary, but are chosen here because 
the binary expansion of any x € (—2”, 2”) has at most n bits in the integral part). 


Definition A.4 (Adapted from [20]) For functions f(x) whose domain is R, we say 
that the (non-uniform) time complexity of f is bounded by a function ” : N? —> N if 
the time complexity of f on [-2* ; 2*] is bounded by a function t : N? > N such that 
t(x,n) < t(X,n) for all x € [—2*,2*]. 


Remark 16 The space complexity of a real function is defined similarly. We say 
the space complexity of f : G —> R is bounded by a function s : Gx N > N if 
there is an oracle TM M which computes f such that for any input n and any oracle 
$ € CF,,M%(n) uses < s(x, n) cells, and the uniform space complexity of f is bounded 
by s’: N > N if for all x € G and all ¢ € CFy,M%(n) uses < s'(n) cells. All coming 
definitions can then be easily extended to talk about space complexity. 


As we want to talk about general functions in F, we extend the approach to more 
general functions. (for conciseness, when x = (x1,...,Xp), X = (X1, ..., Xp), we 
write x € [-2¥, 2X] as a shortcut for xı € [-2*1 i 2i jei p E [-2%, 2%] ). 


Definition A.5 (Complexity for real functions: general case) Consider a func- 
tion f(x1,...,Xp, M1,..-,M,) whose domain is RP x N1. We say that the (non- 
uniform) time complexity of f is bounded by a function ? : NPtd+! — N if 
the time complexity of f(-,...,-,@(m1),...,€(mg)) on [—2%1, 2%] ae [-2*?, 2 | 
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is bounded by a function ¢(-,...,-,€(m1),...,4(mg),-) : NP x N — N such that 
t(x, £(n1),..., £(g),n) < rX, &(n1),..., (ng), n) whenever x € [—2*,2*] . We say 
that f is polynomial time computable if ¢’ can be chosen as a polynomial. We say that 
a vectorial function is polynomial time computable iff all its components are. 


Remark 17 There is some important subtlety: when considering f : N —> Q, as 
Q CR, stating f is computable may mean two things: in the classical sense, given 
integer y, i.e. one can compute py and gy some integers such that f(y) = py/qy, or 
that it is computable in the sense of computable analysis: given some precision n, 
given arbitrary y, and n we can provide some rational (or even dyadic) g, such that 
lan —f(y)| < 27”. As we said, we always consider the latter. 


We do that so this measure of complexity extends the usual complexity measure for 
functions over the integers, where complexity of integers is measured with respects 
of their lengths, and over the reals, where complexity is measured with respect to 
their approximation. In particular, in the specific case of a function f : N? > R, 
that basically means there is some polynomial ¢’ : N¢+! — N so that the time 
complexity of producing some dyadic approximating f(m) at precision 27” is bounded 
by r (&(m:),..., &ma), n). 


In other words, when considering that a function is polynomial time computable, it is 
in the length of all its integer arguments, as this is the usual convention. 


B Some results from [8, 9, 3] 


B.1 Some general statements from [8, 9] 


To be self-content, we recall in this section some results and concepts from [8, 9]. 
All the statements in this section are already present in [8, 9]: we are just repeating 
them here in case this helps. We provide some of the proofs when they are not in the 
preliminary ArXiv version. 


As said in the introduction: 


Definition B.1 (Discrete Derivative) The discrete derivative of f(x) is defined as 
Af(x) = f(x+1)—f(x). We will also write f’ for Af(x) to help readers not familiar with 
discrete differences to understand statements with respect to their classical continuous 
counterparts. 
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Several results from classical derivatives generalise to the settings of discrete differ- 
ences: this includes linearity of derivation (a - f(x) +b - g(x))’ =a-f'(x) +b- g'(x), 
formulas for products and division such as (f(x): g(x))! = F'O g(x + 1)+f(x)-9’(x) = 
f(xt+ DL (x) +f'(x) g(x). Notice that, however, there is no simple equivalent of the chain 
rule, in other words, there is no simple formula for the derivative of the composition 
of two functions. 


A fundamental concept is the following: 


Definition B.2 (Discrete Integral) Given some function f(x), we write 


b 
1 f(x)ôx 


as a synonym for a f(x)ôx = ae f(x) with the convention that it takes value 0 
when a = b and f? f(x)dx = — fy £(x)dx when a > b. 


The telescope formula yields the so-called Fundamental Theorem of Finite Calculus: 


Theorem B.3 (Fundamental Theorem of Finite Calculus) Let F(x) be some function. 
Then, 

b 
f F’'(x)dx = F(b) — F(a). 


a 


A classical concept in discrete calculus is the one of falling power defined as 


xt =x (x 1) (x2) (x—(m-— 1). 


This notion is motivated by the fact that it satisfies a derivative formula (x)! = m- x”=} 


similar to the classical one for powers in the continuous setting. In a similar spirit, we 
introduce the concept of falling exponential. 


Definition B.4 (Falling exponential) Given some function U(x), the expression U 
to the falling exponential x, denoted by one stands for 
2 = (+U@-1)---+U)-0 + U0) 
t=x—1 
= |[a+v©@), 
t=0 

with the convention that Ti =] ' — id, where id is the identity (sometimes denoted 
1 hereafter). 
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This is motivated by the remarks that 2* = 2", and that the discrete derivative of a 
falling exponential is given by 


a”). =U@ 2 


for all x € N. 


Lemma B.5 (Derivation of an integral with parameters) Consider 


b(x) 
F(x) = | f(x, ôt. 


x) 


Then 


b(x) of —a'(x) b'(x) 
Fœ = ih Lant | fx+ 1,a&x +1) + Ast +4 f f(x + 1, b(x) + Hót. 
a(x) Ox 0 0 


In particular, when a(x) = a and b(x) = b are constant functions, F' (x) = I. Hix, tot, 
and when a(x) = a and b(x) = x, F(x) = f Hx, tot + f(x + 1,x). 


Proof 
F(x) = F+ 1)-— F% 
b(x+1)—1 b(x)—1 
= XŠ te+lo- SO tao 
t=a(x+1) t=a(x) 


b(x)—1 t=a(x)—1 b(x+1)—1 


= XO (f+ 1,1) — f(x, t) + > fx LO + 5 f(x + 1,1) 


t=a(x) t=a(x+1) t=b(x) 
b(x)—1 a t=a(x)—1 b(x+1)—1 

= = Od + ` f(x+1,) + yy f(x + 1,1) 
t=a(x) t=a(x+1) t=b(x) 
b(x)—1 a t=—a(x+1)+a(x)—-1 

3N zet y f(x+lax+)+ 
t=a(x) t=0 


b(x+1)—b@œ)—1 


4 5 f(x + 1,b(x) +2). 


t=0 


Lemma B.6 (Solution of linear ODE) For matrices A and vectors B and G, the 
solution of equation f'(x, y) = A(f(x, y), h(x, y), x, y) - f(x, y) + B€, y), h(x, y), x, y) 
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with initial conditions f(0, y) = G(y) is 


f(x,y) = ee Gly) 


xX x 
+ f (D APRON) Bee, y), hlu, y), u, yu 
0 


Proof Denoting the right-hand side by rhs(x, y), we have 


rhs'(x,y) = ACG, y), h(x, y) x, y) (2P OPN"). Gey) 


= f>., A@(t,y),h ot\! 
sft S ion (f@y),h@y),t,y) ‘) . B(f(u, y), h(u, y), u, you 
7 (a A(f(t,y),h(C,y),t,y)5t 


) ` BG, y), h(x, y), X, y) 
= A, y), b, y), x, y) (ZP AED. Gy) 
=F ACE, y), h(x, y), X, y) 
fi (P AEDADE) Btu, y), hlu, y), u, yu 
+ B@&, y), h(, y), x, y) 
= A(f(x, y), h(x, y), x,y); rhs(x, y) + BEG, y), h(x, y), x,y) 
where we have used linearity of derivation and definition of falling exponential for the 
first term, and derivation of an integral (Lemma B.5) providing the other terms to get 


the first equality, and then the definition of falling exponential. This proves the property 
by unicity of solutions of a discrete ODE, observing that rhs(0, y) = G(y). 


We write also 1 for the identity. 


Remark 18 Notice that this can be rewritten as 


x—1 x-1 
(B-1) f@y)= > ( II d Aayah) ` BÆ(u, y), hu, y), u, y), 


t=u+1 


u=—1 


with the (not so usual) conventions that for any function K(-), 0 a K(x) = 1 and 
B(—1,y) = G(y). Such equivalent expressions both have a clear computational 
content. They can be interpreted as an algorithm unrolling the computation of f(x+1, y) 
from the computation of f(x, y), f(x — 1, y),..., f(0,y). 


A fundamental fact is that the derivation with respects to length provides a way to soa 
kind of change of variables. This is Lemma 2.5 in the body of this article. 
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C Some statements from [3] 


We repeat here some statements from [3]. They are motivated by repeating the argu- 
ments for proving Proposition 6.1, 1. 


C.1 On the complexity of solving a linear length ODE 


We have to prove that all functions of LDL’? are computable (in the sense of computable 
analysis) in polynomial time. The hardest part is to prove that the class of polynomial 
time computable functions is preserved by the linear length ODE schema, so we start 
by it. 


Lemma C.1 The class of polynomial time computable functions is preserved by the 
linear length ODE schema. 


We write |||: - -||| forthe sup norm of integer part: given some matrix A = (Aj,j)1<i<n,1<j<m> 
||| Al|| = max; ;[Ai;]. In particular, given a vector x, it can be seen as a matrix with 
m = 1, and |||x||| is the sup norm of the integer part of its components. 


Before proving this lemma, we prove the following result: 


Lemma C.2 (Fundamental observation) Consider the ODE 
(C-1) F'x, y) = A(F(x, y), h&, y), x, y) - Fx, y) + BORG, y), h, y), x, y). 
Assume: 


e The initial condition G(y) =f F(O, y) is polynomial time computable, and 
h(x, y) are polynomial time computable with respect to the value of x. 


e A(F(x, y), h(x, y),x,y) and B(F(x, y), h(x, y),x, y) are cond-polynomial expres- 
sions essentially constant in F(x, y). 


Then, there exists a polynomial p such that «(|E (x, y)|||) < p (x, €(|ly||)) and F(, y) 
is polynomial time computable with respect to the value of x. 


Proof The solution of ordinary differential equation (C—1) can be put in some explicit 
form (this follows from [8, 9], or alternatively Remark 18 following Lemma B.6 in 
appendix, or, if you prefer, can be directly established by induction, using the recurrence 
formula (C-1)). 
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x—l1 x-1 
(C-2) Fix,y) = >> ( Jfa + ACRE yhaa] -BF(u, y), h(u, y), u, y). 


u=—] \t=u+1 


with the conventions that p! K(x) = 1 and B(,—1,y) = G(y). 


This formula permits to evaluate F(x,y), using a dynamic programming approach, 
from the quantities y, u, h(u,y), for 1 < u < x, in a number of arithmetic steps 
that is polynomial in x. Indeed: for any —1 < u < x, A(F(u,y), h(u,y),u,y) and 
B(F(u, y), h(u, y), u, y) are matrices whose coefficients are cond-polynomial. Their 
coefficients involve finitely many arithmetic operations or cond() operations from 
their inputs. Once this is done, computing F(x, y) requires polynomially in x many 
arithmetic operations: basically, once the values for A and B are known we have to 
sum up x + 1 terms, each of them involving at most x — 1 multiplications. 


We need to take care not only on the arithmetic complexity that is polynomial in x, 
but also on the bit complexity. We start by discussing the bit complexity of the integer 
parts. Each of the quantities y, u, h(u,y), for 1 < u < x, being polynomial time 
computable, has its integer part with a bit complexity that remains polynomial in x 
and £(|||y|||). As the bit complexity of a sum, product, etc is polynomial in the size of 
its arguments, it is sufficient to show that the growth rate of function F(x,y) can be 
polynomially dominated. For this, recall that, for any —1 < u < x, coefficients of 
A(F(u, y), hu, y), u, y) and B(F(u, y), h(u, y), u, y) are essentially constant in F(u, y). 
Hence, the size of the integer part of these coefficients do not depend on @(F(u, y)). 
Since, in addition, h is computable in polynomial time in x and &(y), there exists a 
polynomial py such that: 


(C-3) 
max(¢(\|A(F(u, y), hu, y), u, DID, KIB ŒE, y), hu, y), u, DI < puu, «liy. 
It then holds that, 
KIEG +1, PID < pu, Kyl + AEK, pI + 2 


It follows from an easy induction that we must have 


KEC, DID < AEI +x- Pm, Kyl + 2), 


which gives the desired bound on the length of the integer part of the values for function 
F, after observing that, since G is polynomial time computable, necessarily ¢((\|G(y)|||) 
remains polynomial in ¢((|ly|||). 
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We now take care of the bit complexity of involved quantities, in order to prove that 
F(x, y) is indeed polynomial time computable with respect to the value of x. Given 
y, we can determine some integer Y such that y € [2~’,2”]. We just need to prove 
that given n, we can provide some dyadic Zy approximating F(x, y) at precision n, i.e. 
with ||/F(x, y) — z,|| < 27”, in a time polynomial in x and Y. 


Basically, this follows from the possibility of evaluating F(x, y) using formula (C—2). 
This latter formula is made of a sum of x+ 1 terms, that we can call Tp,T),...,T,, each 
of them T; corresponding to a product of k matrices (or vectors) T; = Ci C2... Cko, 
with k(i) < x + 1, were each C; is either some B(F(u, y), h(u, y), u, y) for some u or 
some (1 + A(F(t, y), h(t, y),¢, y)) for some t. 


To solve our problem, it is sufficient to be able to approximate the value of each T; 
by some dyadic d; with precision 2~"~”", considering m with x + 1 < 2”. Indeed, 
taking Zy = )~*_, d; will guarantee an error on the approximation of F(x, y) less than 
(x + 127-7" <2. 


So we focus on the problem of estimating T; = C,Cz... Cy) with precision 27". 
If we write Č; for some approximation of Cj, we can write 


(C4) 
|CiC2 ee Cko — CG ie Čo) | < C,C.. Cyi-1 (Cko — Cui) 1 
+ |[CiCz... Cea—2 (Cko-1 — Čko-1) Čo) | 
+ IC1C2... Cra (Cro-2 — Čro-2) Čro-1Čo || 


t VG- a: 


We just need then to compute some approximation Cx of Cka), guaranteeing the first 
term in (C—4) to be less than 27”7™-™ , and then choose some approximation Čko-1 
of Cy), guaranteeing the second term in (C—4) to be less than 27”7™=™, and so on. 
Doing so, we will solve our problem, as CC, bes Čko will provide an estimation of 
T;, with an error less than (x + 1)2 7” "7" < 27" ™ by (C-4). 


For the first term of (C—4), the point is that we know from a reasoning similar to 
previous computations (namely bounds such as (C-3)) that we have 


||| C1 Cz see Cra || < P1% Y) 
for some polynomial pı. 


Consequently, it is sufficient to take [Cko — Čo || < 272m1% Y) to guarantee an 
error less than 27” ™”", 
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A similar analysis applies for the second, and other terms, that involve finitely many 
terms. The whole approach takes a time that remains clearly polynomial in x and 
Y. 


The previous statements lead to the following: 


Lemma C.3 (Intrinsic complexity of linear £-ODE) Let f be a solution of the linear 
L-ODE 


f0,.y) = gy), 
o, D = w(t(x,y),h(,y),x,9) 


where u is essentially linear in f(x,y). Assume that the functions u,g,h are com- 
putable in polynomial time. Then, f is computable in polynomial time. 


Proof From Lemma 2.5, f(x, y) can also be given by f(x,y) = F(¢(x), y) where F is 
the solution of the initial value problem 


F(l,y) = g(y), 


EY L wFy),2- Ly). 


Functions u are cond-polynomial expressions that are essentially linear in f(x, y). So 
there exist matrices A, B that are essentially constants in f(t, y) such that 


Of (x, y) 
Ol 


In other words, it holds 


F(t, y) = AFC, y), t,y) . F(t, y) + BFC, y), t, y). 


= A(f(x, y), h(x, y), x, y) - £x, y) + BUR, y), h(x, y), x, y). 


by setting 
A(F(t, y),t,y) = A(F(t, y), h(2' — 1,y),2’- l,y) 
BFC, y), t, y) — BFC, y), h(2’ — 1,y), 2” — 1,y) 


The corresponding matrix A and vector B are essentially constant in F(t,y). Also, 
functions g,h are computable in polynomial time, more precisely polynomial in f(x), 
hence in t, and £y). Given fr, obtaining 2f — 1 is immediate. This guarantees that 
all hypotheses of Lemma C.2 are true. We can then conclude remarking, again, that 


t = &(x). 


This proves Lemma C. 1. 
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C.2 Proof of Proposition 6.1, 1. 


We can now state and prove Proposition 6.1, 1: 


Proof of Proposition 6.1, 1. This is proved by induction. This is true for basis func- 
tions, from basic arguments from computable analysis. In particular as cond is a 
continuous piecewise affine function with rational coefficients, it is computable in 
polynomial time from standard arguments. 


Now, the class of polynomial time computable functions is preserved by composition. 
This is proved in [20]: in short, the idea of the proof for COMP(f,g), is that by 
induction hypothesis, there exists My and M, two Turing machines computing in 
polynomial time f : R — R and g : R —> R. In order to compute COMP(f, g)(x) with 
precision 27”, we just need to compute g(x) with a precision 2~”"”, where m(n) is the 
polynomial modulus function of f. Then, we compute f(g(x)), which, by definition 
of My takes a polynomial time in n. Thus, since polynomial time with an oracle in 
polynomial time is polynomial time, COMP(f, g) is computable in polynomial time, 
so the class of polynomial time computable functions is preserved under composition. 
It only remains to prove that the class of polynomial time computable functions is 


preserved by the linear length ODE schema: this is Lemma C.1. 


Institut Polytechnique de Paris, Ecole Polytechnique, LIX, Palaiseau, France 
Institut Polytechnique de Paris, Ecole Polytechnique, LIX, Palaiseau, France 


Manon.Blanc@lix.polytechnique.fr, Olivier.Bournez@polytechnique.fr 


https://perso.crans.org/mblanc/, https://www.1lix.polytechnique.fr/~bournez/ 


Received: aa bb 20YY Revised: cc dd 20ZZ 


